Document processing apparatus and document processing method

ABSTRACT

A document processing apparatus has a language tag setting section for setting a language tag designating a kind of language at each of constituent unit positions in an original and those in a translated version corresponding to the original, and a document forming section for forming a tagged document including the original and the translated version each having the language tag set therein. This apparatus enables a user to easily form and maintain a home page (Web page data) expressed in a plurality of languages.

BACKGROUND OF THE INVENTION

[0001] The present invention relates to a document processing apparatusand method which analyzes a tagged document, e.g., a Hyper Text MarkupLanguage (HTML) document and forms another tagged document containingoriginal sentences and translated sentences. The present invention alsorelates to a recording medium for recording such a tagged document.

[0002] With the recent proliferation of personal computers andcommunication apparatus, people have become able to use communicationnetworks represented by the Internet, i.e., Internet protocol (IP)communication networks, and to easily obtain various sorts ofinformation through the networks. Ordinarily, the World Wide Web in theInternet generally uses HTML as a language for describing information.Dynamic HTML (DHTML) and Extensible Markup Language (XML) are otherlanguages presently used to form tagged documents.

[0003] Conventionally, to form home pages (also called Web pages)containing the same information described in different languages, e.g.,English and Japanese, the process of separately forming each home pageis required. That is, the steps of forming sentences separately in eachlanguage, pasting common images including graphs and figures, separatelysetting links from each of the English home page and the Japanese homepage, etc., are required.

[0004] Home pages are open to the public on the Internet and can be readby people everywhere in the world. Therefore, people who set up theirhome pages for various readers to read are making home pages having thesame format and contents but having sentences written in differentlanguages.

[0005] The English and Japanese home pages formed as described aboveneed scrupulous attention for maintenance because their English andJapanese sentences after correction must be equivalent in meaning andformat.

[0006] To form such English and Japanese home pages, it is necessary tomake and manage two kinds of HTML document files for the English homepage and the Japanese home page. That is, the number of files to bemanaged is increased in proportion to the number of languages, andmanagement and maintenance becomes difficult.

[0007] A user who wishes to read document information in an English homepage in Japanese may translate the English document information by usingInternet translation software. However, if the user wishes to edit theresults of the translation displayed as an HTML document, he or she mustgive up the idea of doing so or is further required to translate thetranslation results by different translation software, because directediting of the translated HTML document is impossible.

[0008] If the user dares to edit the translation-result HTML document,he or she must perform the steps of storing the translation-result HTMLdocument on a local disk, opening the HTML document file stored on thelocal disk by using HTML document editing software, displaying the HTMLdocument source, directly editing the HTML document source, and storingthe results of the editing on the local disk. This process enablesediting of the translation results to some effect. However, it isdifficult to edit a document in which HTML tags, original sentences, andtranslated sentences are mixed.

[0009] Further, in a case where an HTML document intended as an objectof translation is prepared in advance and, from this document, anotherHTML document described in a different language is formed by translationprocessing using Internet translation software, a need may arise to editthe HTML document in the second language formed by the translationprocessing and, if necessary, the translation-object HTML document, ifthe author of the HTML document is not satisfied with the results of thetranslation.

[0010] In this editing, it is difficult to determine document portionsto be edited and to confirm the correspondence between original andtranslated sentences, since the translation-object HTML document and thetranslation-result HTML document exist in separate files. It is alsopossible that, through editing, the page configuration (format) of onedocument will become different from that of the other.

[0011] As described above, the conventional HTML document processingapparatus can be designed to enable translation of an original home pageon the Internet using Internet translation software and visual displayof original and translated sentences in a juxtaposed form. However, inediting translation results, an HTML document itself cannot be edited.There is a way to directly edit the HTML document source, but editing insuch a way is extremely troublesome and not satisfactorily effective.

SUMMARY OF THE INVENTION

[0012] In view of the above-described circumstances, an object of thepresent invention is to provide a document processing apparatus and arecording medium which make it possible to easily form and maintain ahome page (Web page data) expressed in two or more languages.

[0013] Another object of the present invention is to provide a documentprocessing apparatus and a recording medium which make it possible toeasily edit translated sentences obtained as a result of translation ofa tagged document.

[0014] Still another object of the present invention is to provide adocument processing apparatus and a recording medium which make itpossible to selectively display original sentences in a tagged documentand translated sentences obtained as a result of translation of thetagged document.

[0015] A further object of the present invention is to provide adocument processing apparatus and a recording medium which make itpossible to return the edited translation sentences of the result oftranslation into a tagged document and to use it.

[0016] A further object of the present invention is to provide adocument processing apparatus and a recording medium which make itpossible to easily edit original sentences in a tagged document to froma more favorable translation.

[0017] To achieve the above-described objects, according to one aspectof the present invention, there is provided a document processingapparatus comprising a language tag setting unit for setting a languagetag designating a kind of language at each of constituent unit positionsin an original and those in a translated version corresponding to theoriginal, and a document forming unit for forming a tagged documentincluding the original and the translated version each having thelanguage tag set therein.

[0018] The above-described document forming unit may form a taggeddocument in which the original and the translated version each havingthe language tag set therein are described in an original-versus-versionform.

[0019] The above-described document processing apparatus furthercomprises a translation unit for translating the original to make thecorresponding translated version.

[0020] The original may be contained in the processing-object taggeddocument.

[0021] The above-described document processing apparatus furthercomprises a visibility control tag setting unit for setting a visibilitycontrol tag for controlling any document portion so that the documentportion is invisible, the visibility control tag setting unit settingthe visibility control tag at such a position that one of the originaland the translated version is in an invisible state.

[0022] The visibility control tag setting unit may set the visibilitycontrol tag at such a position that one of the original and thetranslated version is set in an invisible state and the language tag isalso set in an invisible state.

[0023] The above-described document processing apparatus furthercomprises a display processing unit for interpreting the tag containedin the tagged document and for enabling a browser for displaying thetagged document to display the formed tagged document on the basis of aresult of the interpretation in the above document processing apparatus.

[0024] According to another aspect of the present invention, there isprovided a document processing apparatus comprising a language tagsetting unit for setting a language tag at each of constituent unitpositions in a first text described in a first language and those in asecond text described in a second language of the first text, and adocument forming unit for forming a tagged document including the firsttext and the second text each having the language tag set therein.

[0025] The above-described document forming unit may form a taggeddocument in which the first text and the corresponding second text eachhaving the language tag set therein are described by being related toeach other.

[0026] The above-described document processing apparatus furthercomprises a visibility control tag setting unit for setting a visibilitycontrol tag for controlling a text so that the text is invisible, thevisibility control tag setting unit setting the visibility control tagat such a position that one of the first text and the second text is inan invisible state.

[0027] The visibility tag setting unit may set the visibility controltag at such a position that one of the first text and the second text isset in an invisible state and the language tag is also set in aninvisible state.

[0028] The above-described document processing apparatus furthercomprises a display processing unit for interpreting the tag containedin the tagged document and for enabling a browser for displaying thetagged document to display the formed tagged document on the basis of aresult of the interpretation.

[0029] According to still another aspect of the present invention, thereis provided a document processing apparatus comprising an analysis unitfor determining and extracting an original text from a processing-objecttagged document, and an editing unit for enabling edit processing of theoriginal text extracted from the tagged document by displaying theoriginal text.

[0030] The analysis unit may determine the original text according to alanguage tag contained in the processing-object tagged document.

[0031] The above-described document processing apparatus furthercomprises a translation processing unit for making a translated versionby translating the original text edited by the editing unit.

[0032] The above-described document processing apparatus furthercomprises a restoration unit for restoring, from the original text afterthe editing and the translated version made by the translationprocessing unit, a tagged document in the same format as theprocessing-object tagged document according to tags contained in theprocessing-object tagged document, the tags including the language tag.

[0033] According to yet another aspect of the present invention, thereis provided a document processing apparatus comprising an analysis unitfor determining an original text and a translated version correspondingto the original text in a processing-object tagged document, and anediting unit for enabling edit processing of the original text and thetranslated version by extracting the original text and the translatedversion from the tagged document and by displaying the original text andthe translated version in an original-versus-version form.

[0034] The analysis unit may determine the original text and thetranslated version according to language tags contained in theprocessing-object tagged document.

[0035] The above-described document processing apparatus furthercomprises a translation processing unit for making a translated versionby translating the original text edited by the editing unit.

[0036] The above-described document processing apparatus furthercomprises a restoration unit for restoring, from the original text afterthe editing and the translated version made by the translationprocessing unit, a tagged document in the same format as theprocessing-object tagged document according to tags contained in theprocessing-object tagged document, the tags including the language tag.

[0037] According to a further aspect of the present invention, there isprovided a recording medium readable by a computer, the recording mediumhaving a program recorded thereon, the program for enabling the computerto execute a step of setting a language tag designating a kind oflanguage at each of constituent unit positions in an original and thosein a translated version corresponding to the original, and a step offorming a tagged document including the original and the translatedversion each having the language tag set therein.

[0038] The program recorded on this recording medium may enable thecomputer to further execute a step of setting a visibility control tagfor controlling any document portion so that the document portion isinvisible, the visibility control tag being set at such a position thatone of the original and the translated version is in an invisible state.

[0039] According to still a further aspect of the present invention,there is provided a recording medium readable by a computer, therecording medium having a program recorded thereon, the program forenabling the computer to execute a step of setting a language tagdesignating a kind of language at each of constituent unit positions ina first text described in a first language and those in a second textdescribed in a second language of the first text, and a step of forminga tagged document including the first text and the second text eachhaving the language tag set therein.

[0040] The program recorded on this recording medium may enable thecomputer to further execute a step of setting a visibility control tagfor controlling a text so that the document is invisible, the visibilitycontrol tag being set at such a position that one of the first text andthe second text is in an invisible state.

[0041] According to still a further aspect of the present invention,there is provided a recording medium readable by a computer, therecording medium having a program recorded thereon, the program forenabling the computer to execute a step of determining and extracting anoriginal text from a processing-object tagged document, and a step ofenabling editing of the text extracted from the tagged document bydisplaying the original text.

[0042] The program recorded on this recording medium may enable thecomputer to further execute a step of making a translated version bytranslating the original text after editing of the original text, and astep of restoring, from the original text after the editing and thetranslated version made by the translation, a tagged document in thesame format as the processing-object tagged document according to tagscontained in the processing-object tagged document, the tags including alanguage tag designating a kind of language.

[0043] According to still a further aspect of the present invention,there is provided a recording medium readable by a computer, therecording medium having a program recorded thereon, the program forenabling the computer to execute a step of determining an original textand a translated version corresponding to the original text in aprocessing-object tagged document, and a step of enabling editing of theoriginal text and the translated version by extracting the original textand the translated version from the tagged document and by displayingthe original text and the translated version in anoriginal-versus-version form.

[0044] The program recorded on this recording medium may enable thecomputer to further execute a step of forming a translated version bytranslating the original text after editing of the original text, and astep of restoring, from the original text after the editing and thetranslated version made by the translation, a tagged document in thesame format as the processing-object tagged document according to tagscontained in the processing-object tagged document, the tags including alanguage tag designating a kind of language.

[0045] According to still a further aspect of the present invention,there is provided a recording medium readable by a computer, therecording medium having a tagged document recorded thereon, the recordedtagged document comprising a first language tag designating a first kindof language; a first text following the first language tag, the firsttext being described in a first language; a second language tagfollowing the first text, the second language tag designating a secondkind of language; and a second text following the second language tag,the second text being described in a second language and correspondingto the first text.

[0046] According to still a further aspect of the present invention,there is provided a recording medium readable by a computer, therecording medium having a tagged document recorded thereon, the recordedtagged document comprising an invisibility start tag for setting afollowing text in an invisible state; a first language tag following theinvisibility start tag, the first language tag designating a first kindof language; a first text following the first language tag, the firsttext being described in a first language; a second language tagfollowing the first text, the second language tag designating a secondkind of language; an invisibility end tag following the second languagetag, the invisibility end tag canceling the invisible state; and asecond text following the invisibility end tag, the second text beingdescribed in a second language and corresponding to the first text.

[0047] According to a further aspect of the present invention, there isprovided a document processing method comprising a step of setting alanguage tag designating a kind of language at each of constituent unitpositions in an original and those in a translated version correspondingto the original; and a step of forming a tagged document including theoriginal and the translated version each having the language tag settherein.

[0048] The above-described document processing method further comprisesa step of setting a visibility control tag for controlling any documentportion so that the document portion is invisible, said visibilitycontrol tag being set at such a position that one of the original andthe translated version is in an invisible state.

[0049] According to still a further aspect of the present invention,there is provided a document processing method comprising a step ofsetting a language tag designating a kind of language at each ofconstituent unit positions in a first text described in a first languageand those in a second text described in a second language of the firsttext; and a step of forming a tagged document including the first textand the second text each having the language tag set therein.

[0050] The above-described document processing method further comprisesa step of setting a visibility control tag for controlling a text sothat the document is invisible, said visibility control tag being set atsuch a position that one of the first text and the second text is in aninvisible state.

[0051] According to still a further aspect of the present invention,there is provided a document processing method comprising a step ofdetermining and extracting an original text from a processing-objecttagged document; and a step of enabling editing of the original textextracted from the tagged document by displaying the original text.

[0052] The above-described document processing method further comprisesa step of making a translated version by translating the original textafter editing of the original text; and a step of restoring, from theoriginal text after the edit processing and the translated versionformed by said translation, a tagged document in the same format as theprocessing-object tagged document according to tags contained in theprocessing-object tagged document, said tags including a language tagdesignating a kind of language.

[0053] According to still a further aspect of the present invention,there is provided a document processing method comprising a step ofdetermining an original text and a translated version corresponding tothe original text in a processing-object tagged document; and a step ofenabling edit processing of the original text and the translated versionby extracting the original text and the translated version from thetagged document and by displaying the original text and the translatedversion in an original-versus-version form.

[0054] The above-described document processing method further comprisesa step of forming a translated version by translating the original textafter editing of the original text; and a step of restoring, from theoriginal text after the edit processing and the translated version madeby said translation, a tagged document in the same format as theprocessing-object tagged document, according to tags contained in theprocessing-object tagged document, said tags including a language tagdesignating a kind of language.

[0055] According to the present invention, it is not necessary to formand manage a file with respect to each of a plurality of languages, sothat the maintenance can be easily carried out.

[0056] According to the present invention, an original text and atranslated version of the original text obtained by translation aredisplayed in an original-versus-version form such that edit processingand version can be easily carried out.

[0057] According to the present invention, an original text in a taggeddocument and a translated version of the original text obtained bytranslation can be selectively displayed.

[0058] Further, according to the present invention, an edited translatedversion of the result of translation can be reformatted to the taggeddocument and used.

[0059] Further, according to the present invention, an original text ina tagged document can be easily edited to obtain a more favorabletranslated version.

BRIEF DESCRIPTION OF THE DRAWINGS

[0060] These objects and advantages of the present invention will becomemore apparent and more readily appreciated from the following detaileddescription of the presently preferred exemplary embodiments, taken inconjunction with the accompanying drawings of which:

[0061]FIG. 1 is a block diagram showing the configuration of an HTMLdocument processing apparatus which represents a first embodiment of thepresent invention;

[0062]FIG. 2 is a diagram schematically showing the format and contentsof an HTML document;

[0063]FIG. 3 is a diagram explaining a state where HTML documents arecombined into one file and stored, and an edited state by a translationeditor;

[0064]FIG. 4 is a flowchart for explaining the operation of the HTMLdocument processing apparatus of the first embodiment;

[0065]FIG. 5 is another flowchart for explaining the operation of theHTML document processing apparatus of the first embodiment;

[0066]FIG. 6 is a diagram showing a specific example of anoriginal-versus-version HTML document source;

[0067]FIG. 7 is a block diagram showing the configuration of an HTMLdocument processing apparatus which represents second and thirdembodiments of the present invention;

[0068]FIG. 8 is a flowchart for explaining the operation of the HTMLdocument processing apparatus of the second embodiment;

[0069]FIG. 9 is another flowchart for explaining the operation of theHTML document processing apparatus of the second embodiment;

[0070]FIG. 10 is a diagram showing an example of an initial window of anEnglish-Japanese translation editor (original-versus-version editorprogram);

[0071]FIG. 11 is a flowchart for explaining the operation of the HTMLdocument processing apparatus of the third embodiment; and

[0072]FIG. 12 is another flowchart for explaining the operation of theHTML document processing apparatus of the third embodiment.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0073] Embodiments of the present invention will be described withreference to the accompanying drawings.

[0074] [First Embodiment]

[0075] [Configuration of HTML Document Processing Apparatus]

[0076]FIG. 1 shows functional components of an HTML document processingapparatus 1 which represents a first embodiment of the presentinvention. As shown in FIG. 1, the HTML document processing apparatus 1is connected to a World Wide Web (WWW) server 3 through an IPcommunication network, i.e., Internet 2. In this embodiment, the WWWserver 3 holds English home pages (Web pages in which contents aredescribed in English).

[0077] The HTML document processing apparatus 1 can be realized by apersonal computer, and has an input section 10, a display section 11, adisplay processing section 12, a communication control section 13, anautomatic translation processing section 14, an original-versus-versionHTML document forming processing section 15, an individual-language HTMLdocument conversion processing section 16, a data storage section 17,etc. The HTML document processing apparatus 1 is connected to a WWWbrowser 4 provided as an Internet display tool.

[0078] The personal computer has, as is well known, a central processingunit (CPU), a random access memory (RAM), a hard disk, a drive unit foraccessing a portable recording medium, such as a floppy disk or acompact disk-read only memory (CD-ROM), to read out a program or datarecorded thereon, a communication control unit, such as a modern or alocal area network (LAN) board, a display, a keyboard, and a mouse.

[0079] The input section 10 of the HTML document processing apparatus 1in this embodiment corresponds to the keyboard and the mouse; thedisplay section 11 to the display; the data storage section 17 to theRAM, the hard disk or the portable recording medium; and thecommunication control section 13 to the communication control unit.

[0080] The display processing section 12, the automatic translationprocessing section 14, the original-versus-version HTML document formingprocessing section 15, and the individual-language HTML documentconversion processing section 16 are realized by making the CPU executea program of the present invention and a program of Internet translationsoftware.

[0081] The program of the present invention and the Internet translationsoftware are provided by being recorded on the portable recordingmedium, and are read out by the drive unit and stored in the hard diskin advance.

[0082] The above-described program and software stored in an externalunit, may be downloaded to the HTML document processing apparatus 1through the network using the communication control unit, and stored inthe hard disk.

[0083] The program of the present invention and the Internet translationsoftware stored in the hard disk are read to the RAM to be executed bythe CPU. The programs of the present invention and the Internettranslation software may be directly read to the RAM without beingstored in the hard disk, instead of being temporality stored in the harddisk and thereafter read to the RAM as described above.

[0084] HTML documents formed by the HTML document processing apparatus 1in this embodiment of the present invention are stored in the RAM, thehard disk or the portable recording medium.

[0085] In the HTML document processing apparatus 1 arranged as describedabove, the automatic translation processing section 14 translates anEnglish HTML document (an original including sentences and figures) ofan English home page described in English into a Japanese HTML document(a translated version) described in Japanese by using the home pagetranslation functions of the Internet translation software, e.g.,“ATLAS” (trademark of FUJITSU LIMITED) and the original and the resultsof the translation are visually displayed so that pairs of original andtranslated paragraphs or sentences are juxtaposed in upper and lowerpositions, as described below in detail.

[0086] In this translation and display process, theoriginal-versus-version HTML document forming processing section 15attaches a language tag, which is a special tag for identification ofthe kind of language, to the beginning of each of paragraphs orsentences which are constituent units of the original document and thetranslated document. The individual-language HTML document conversionprocessing section 16 recognizes the attached language tag to enableselective display of only one of the original and translated documentsby further adding an invisibility tag (visibility control tag) toinstruct to make no visual display.

[0087] [Operation of HTML Document Processing Apparatus]

[0088] The operation of the HTML document processing apparatus 1 in thefirst embodiment of the present invention shown in FIG. 1 will now bedescribed. FIG. 2 schematically shows the compositions and contents oforiginal and translated documents and a state where these documents arestored and displayed by being juxtaposed in the above-mentioned manner.FIG. 3 shows a state where an original HTML document and a version ofthe HTML document are stored and displayed while being combined into onefile in such a form that each of pairs of original and translatedsentences are juxtaposed in left and right positions. FIGS. 4 and 5 areflowcharts for explaining the operation of the HTML document processingapparatus 1.

[0089] (Original-Versus-Version HTML Document Forming Processing)

[0090] The operation will be described by way of example with respect toa case where, in the HTML document processing apparatus 1, an EnglishHTML document (original) of an English home page described in English istranslated into a Japanese HTML document (translated version) describedin Japanese, and an original-versus-version HTML document is formed fromthe original and version and is visually displayed.

[0091] Referring also to FIGS. 1 through 5, a user using the HTMLdocument processing apparatus 1 searches the Internet for a home pagedescribed in a language which he or she wishes to translate, byoperating the input section 10 and based on the contents of a displaymade by the display section 11 (WWW browser). In this example, theobject of this search (processing step S401 in FIG. 4) is an Englishhome page held by the WWW server 3.

[0092] Next, the user initiates a program for translation in the desireddirection by operating the input section 10 and based on the contents ofthe display made by the display section 11. In this example, the userwishes to translate an English HTML document of the English home pageinto a Japanese HTML document described in Japanese and thereforeinitiates an English-into-Japanese translation program (S402).

[0093] The user clicks a “translation” button displayed in an initialwindow (dialogue window) (not shown) of the translation program to makethe HTML document processing apparatus 1 translate and arrange theEnglish HTML document in such a manner that each of pairs of originaland translated paragraphs or sentences can be displayed by beingjuxtaposed in upper and lower positions (S403). According to the user's“translation” button clicking operation, the automatic translationprocessing section 14 having the translation program read out from thehard disk, translates the English HTML document supplied from the WWWserver 3 through the communication control section 13 into a Japanesedocument, and thereby forms an HTML document in which each of a certainnumber of English paragraphs or sentences are followed by a line breaktag and a corresponding Japanese paragraph or sentence, that is, each ofpairs of original and translated paragraphs or sentences are displayedby being juxtaposed in upper and lower positions. Known techniques canbe used for the process of translation in the automatic translationprocessing section 14, and therefore such translation techniques willnot be described in detail in this specification.

[0094] The original-versus-version HTML document forming processingsection 15, supplied with the results of translation performed by theautomatic translation section 14, i.e., the HTML document containingJapanese sentences, starts analyzing the HTML document (S404) to addlanguage tags for enabling discrimination between the original andtranslated paragraphs or sentences in the individual-language HTMLdocument conversion processing section 16 described below in detail.

[0095] The original-versus-version HTML document forming processingsection 15 first recognizes the part of a text of the inputted HTMLdocument. The original-versus-version HTML document forming processingsection 15 detects an HTML tag such as specifically shown in FIG. 6(shown in the space between symbols < and > in FIG. 6) to recognize thesection following this tag as English texts and Japanese texts (S405).

[0096] Next, the original-versus-version HTML document formingprocessing section 15 determines, if the texts recognized in step S405are an English paragraph or sentence (original) or a Japanese paragraphor sentence (translated version) according to the position of theparagraphs or sentences (in the order of an English paragraph orsentence, a line break tag, and a Japanese paragraph or sentence) (stepS406). If the original-versus-version HTML document forming processingsection 15 recognizes an English paragraph or sentence as a result ofthis determination, it embeds a language tag <E> at the beginning of thedocument-component-unit paragraph or sentence (S407). If it recognizes aJapanese paragraph or sentence, it embeds a language tag <J> at thebeginning of the document-component-unit paragraph or sentence (S408).

[0097] The original-versus-version HTML document forming processingsection 15 proceeds to processing of the next paragraph in theoriginal-versus-version HTML document, and makes a determination as towhether an HTML tag </html> indicating the end of the HTML document isdetected or not (steps S409, S410). The original-versus-version HTMLdocument forming processing section 15 repeats the above-described stepsS404 to S410 until the HTML tag indicating the end of the HTML documentis detected.

[0098] The original-versus-version HTML document forming processingsection 15 stores in the data storage section 17 anoriginal-versus-version HTML document obtained as a result of theoriginal-versus-version HTML document forming processing based on theabove-described sequence of document analysis steps, when the HTML tagindicating the end of the HTML document is detected. The state of datathereby stored is shown as an original-versus-version HTML documentsource 21 in FIG. 2. In FIG. 2, symbols Δ, and ∇ indicate Englishsentences and symbols ◯, and ⊚ indicate Japanese sentences.

[0099] After the original-versus-version HTML document formingprocessing performed by the original-versus-version HTML documentforming processing section 15 based on the above-described sequence ofdocument analysis steps, the display processing section 12 visuallydisplays the stored original-versus-version HTML document in theupper-lower juxtaposition form on the screen of the display section 11(WWW browser 4) (step S411).

[0100] The original-versus-version HTML document source 21 is displayedin the upper-lower juxtaposition form, as indicated by referencenumerals 20 in FIG. 2. In FIG. 2, “image1” and “image2” representfigures in the document.

[0101] (Processing for Selectively Displaying English HTML Document(Original) and Japanese HTML Document (Version))

[0102] In the HTML document processing apparatus, processing describedbelow is performed after the original-versus-version HTML documentforming processing based on the above-described sequence of documentanalysis steps. The processing described below enables selective displayof only one of the English HTML document (original) and the JapaneseHTML document (translated version) displayed in the upper-lowerjuxtaposition form on the screen of the display section 11.

[0103] The user determines display of the document in the particularlanguage for selective display of the document, and clicks a buttoncorresponding to the language to be displayed (a “Japanese” button inthis example) (S412 in FIG. 5). In response to the user's “Japanese”button clicking operation, the HTML document processing apparatus 1hands over processing to the individual-language HTML documentconversion processing section 16 to enable selective display of only theEnglish HTML document (original) or the Japanese HTML document(translated version). In the case where the objective languages areEnglish and Japanese as in this example, “English” and “Japanese”buttons (not shown) are displayed.

[0104] The individual-language HTML document conversion processingsection 16 reads out the original-versus-version HTML document source 21to the data storage section 17 as shown in FIG. 2, and starts analyzingthe document (S413). The individual-language HTML document conversionprocessing section 16 makes a determination as to whether or not alanguage tag <E> or <J> embedded in the original-versus-version HTMLdocument source 21 has been detected (S414).

[0105] If the individual-language HTML document conversion processingsection 16 has detected a language tag in the Englishoriginal-vs.-Japanese version HTML document, it sets an invisibilitystart tag <INVISIBILITY START> as a visibility control tag immediatelybefore the detected language tag (S415).

[0106] Thereafter, to enable visual display of only the Japanesesentences in the English original-vs.-Japanese version HTML document,the individual-language HTML document conversion processing section 16makes a determination as to whether or not the language tag <J> isdetected (S416). If the individual-language HTML document conversionprocessing section 16 has detected a language tag <J>, it sets aninvisibility end tag <INVISIBILITY END> as a visibility control tagimmediately after the detected tag (immediately before a translatedsentence) (S417).

[0107] The well-known comment tag may be used as the above-mentionedvisibility control tag. A sentence or paragraph bracketed by a pair ofcomment tags is not displayed by the WWW browser 4.

[0108] If the individual-language HTML document conversion processingsection 16 recognizes the detected language tag as a language tag otherthan the language tag <J> in above step S416, it sets an invisibilityend tag <INVISIBILITY END> at the end of the paragraph (S418). Thisprocess is effective in processing an original-versus-version HTMLdocument containing sentences written in three or more languages. Forexample, if the original-versus-version HTML document contains Germansentences (original) together with English sentences (original) andJapanese sentences (translated version), this process makes the Germansentences invisible as well as the English sentences.

[0109] An original-versus-version HTML document 22 having invisibilitytags <INVISIBILITY START>, <INVISIBILITY END> as visibility control tagsadded thereto as well as language tags <E> and <J>, as shown in FIG. 2,is formed in this manner to display Japanese sentences only. FIG. 2shows this HTML document with source codes.

[0110] If no language tag is detected in above step S414, or after thecompletion of processing of the step S417 or 418, theindividual-language HTML document conversion processing section 16proceeds to processing of the next paragraph in theoriginal-versus-version HTML document, and makes a determination as towhether or not an HTML tag </html> indicating the end of the HTMLdocument has been detected (steps S419, S420). The individual-languageHTML document conversion processing section 16 repeats theabove-described steps S413 to S420 until the HTML tag indicating the endof the HTML document is detected.

[0111] The individual-language HTML document conversion processingsection 16 stores in the data storage section 17 the HTML document withinvisibility tags obtained as a result of the above-described sequenceof process steps for selective display of the original-versus-versionHTML document when the HTML tag indicating the end of theoriginal-versus-version HTML document is detected.

[0112] The display processing section 12 then opens the HTML documentwith invisibility tags by the WWW browser 4, and visually displays thedocument on the screen of the display section 11 (S421). A Japanesesentence HTML display 23 shown in FIG. 2 represents the state of thisdisplay on the screen.

[0113] Further, FIG. 2 shows the source code for enabling display ofEnglish sentences only in an original-versus-version HTML document 24,and shows an HTML display 25 which represents the state of display onthe screen when the HTML document 24 is displayed by the WWW browser 4.

[0114] [Second Embodiment]

[0115] [Configuration of HTML Document Processing Apparatus]

[0116]FIG. 7 shows functional components of an HTML document processingapparatus 1 which represents a Embodiment 2 of the present invention.The HTML document processing apparatus 1 is connected to a World WideWeb (WWW) server 3 through an IP communication network, i.e., Internet2. In this embodiment, the WWW server 3 holds English home pages.

[0117] The HTML document processing apparatus 1 can be realized by apersonal computer, and has an input section 10, a display section 11, adisplay processing section 12, a communication control section 13, anautomatic translation processing section 14, an original-versus-versionHTML document forming processing section 15, an individual-language HTMLdocument conversion processing section 16, a data storage section 17, anHTML document analysis and conversion processing section 18, anEnglish-Japanese translation editor (original-versus-version editorprogram) 19, an original-versus-version HTML document conversionprocessing section 40, etc. The HTML document processing apparatus 1 isconnected to a WWW browser 4 provided as an Internet display tool.

[0118] In the HTML document processing apparatus arranged as describedabove, the automatic translation processing section 14 translates anEnglish HTML document (an original including sentences and figures) ofan English home page described in English into a Japanese HTML document(a translated version) described in Japanese, by using the home pagetranslation functions of the above-mentioned Internet translationsoftware, and the original and the results of the translation arevisually displayed in such a form that each of pairs of original andtranslated paragraphs or sentences are juxtaposed in upper and lowerpositions.

[0119] In this translation and display process, theoriginal-versus-version HTML document forming processing section 15attaches a language tag, which is a special tag for determination of thekind of language, to the beginning of each of paragraphs or sentenceswhich are constituent units of the original document and the translateddocument.

[0120] The HTML document analysis and conversion processing section 18and the English-Japanese translation editor (original-versus-versioneditor program) 19 extract data with respect to each language, when datais edited in an original-versus-version form. After editing oforiginal-versus-version data performed by the editor, HTML tags arerestored to form an HTML document from the edited data.

[0121] If an HTML document is translated by Internet translationsoftware, translation results outputted in an original-versus-versionform cannot be edited. However, document data prepared by removing HTMLtags from the translation results is supplied to the editor capable ofediting data in an original-versus-version form to enable editing of thetranslation results. Also, HTML tags are restored in edit results toenable the edit results to be used as an HTML document.

[0122] [Operation of HTML Document Processing Apparatus]

[0123] The operation of the HTML document processing apparatus 1 inEmbodiment 2 of the present invention shown in FIG. 7 will now bedescribed. FIGS. 8 and 9 are flowcharts for explaining the operation ofthe HTML document processing apparatus 1.

[0124] (Original-Versus-Version HTML Document Forming Processing)

[0125] Referring to FIGS. 7 through 9, in the HTML document processingapparatus 1, an English HTML document (original) of an English home pagedescribed in English is translated, an HTML document including theformed original and translated version is analyzed, and anoriginal-versus-version HTML document having language tags embeddedtherein is formed and visually displayed. This document forming anddisplaying processing is the same as that performed by the HTML documentprocessing apparatus 1 Embodiment 1 described above, i.e., the processsteps S401 to S414 shown in FIG. 4.

[0126] (Original-Versus-Version HTML Document Data Conversion Processingand Translated Version Edit Processing)

[0127] In the HTML document processing apparatus 1, after theabove-described processing of forming an original-versus-version HTMLdocument displayed in an upper-lower juxtaposition form on the screen ofthe display section 11, data conversion processing and translatedversion edit processing can be performed in a process described below.

[0128] When a user clicks an “original-versus-version edit processingstart” button, the automatic translation processing section 14 havingtranslation software hands over processing to the HTML document analysisand conversion processing section 18 (S812 in FIG. 8) to enable editingof the Japanese HTML document (version) in the original-versus-versionHTML document 20 shown in FIG. 2.

[0129] The HTML document analysis and conversion processing section 18starts processing for enabling editing using the English-Japanesetranslation editor (original-versus-version editor program) 19 byremoving language tags from the original-versus-version HTML documentsource 21 having language tags <E> and <J> added thereto (S813).

[0130] First, the HTML document analysis and conversion processingsection 18 makes a determination as to whether a language tag <E> or <J>embedded in the original-versus-version HTML document in theoriginal-versus-version HTML document forming processing section 15 hasbeen detected (S814).

[0131] If the HTML document analysis and conversion processing section18 has detected a language tag <E> or <J> in the Englishoriginal-versus-Japanese version HTML document, it stores in the datastorage section 17 language data together with language tag start andend addresses (S815).

[0132] Thereafter, to enable display and edit of English sentences inthe English original-vs.-Japanese version HTML document, the HTMLdocument analysis and conversion processing section 18 makes adetermination as to whether or not the detected language tag is alanguage tag <E> (S816). If the detected language tag is a language tag<E>, the HTML document analysis and conversion processing section 18stores in an original document storage area in the data storage section17, the start address of a document constituent unit paragraph (orsentence) and the contents of the paragraph (S817).

[0133] If the result of above determination in step S816 detects alanguage tag <J>, the HTML document analysis and conversion processingsection 18 stores in a version storage area in the data storage section17, the start address of a document constituent unit paragraph (orsentence) and the contents of the paragraph (S818) to enable Japanesesentence display and edit. As a result of the above process steps S815to S818, English and Japanese paragraphs or sentences from which HTMLtags have been removed and the start and end addresses of the paragraphsor sentences are stored in the data storage section 17.

[0134] If no language tag is detected in above step S814, or after thecompletion of processing in above step S817 or 818, the HTML documentanalysis and conversion processing section 18 proceeds to processing ofthe next paragraph in the original-versus-version HTML document 21, andmakes a determination as to whether or not an HTML tag </html>indicating the end of the HTML document is detected (steps S819, S820).

[0135] The HTML document analysis and conversion processing section 18repeats the above-described steps S813 to S820 in cooperation with theEnglish-Japanese translation editor (original-versus-version editorprogram) 19, until the HTML tag indicating the end of the HTML documentis detected. FIG. 10 shows an example of an initial window of theEnglish-Japanese translation editor.

[0136] When the HTML tag indicating the end of theoriginal-versus-version HTML document is detected, the HTML documentanalysis and conversion processing section 18 visually displays (S821),in the window of the English-Japanese translation editor (displaysection 11), the results of the above-described sequence of processsteps for converting the original-versus-version HTML document data,i.e., English paragraphs or sentences (original) and Japanese paragraphsor sentences (translated version) from which the language tags and theHTML tags have been removed.

[0137] The user edits the Japanese sentences according to his or herneed on the basis of the English and Japanese paragraphs or sentencesvisually displayed in the left-right juxtaposition form in the window ofthe English-Japanese translation editor. The symbol  in the translatedsentences shown in FIG. 3 designates edited portions. When the userclicks an “end” button in the English-Japanese translation editor windowafter suitably editing the Japanese version, the English-Japanesetranslation editor (original-versus-version editor program) 19 storesthe results of editing in the data storage section 17, therebycompleting the edit processing (S822).

[0138] (Processing for Restoration to HTML Document Format)

[0139] In the HTML document processing apparatus 1, after theabove-described original-versus-version HTML document data conversionprocessing and translated version editing processing, a processdescribed below is further performed to enable restoration to the HTMLdocument format.

[0140] When the user clicks an “HTML display” button in theEnglish-Japanese translation editor window, the English-Japanesetranslation editor (original-versus-version editor program) 19 isstarted (S823). By an instruction from the English-Japanese translationeditor 19, the original-versus-version HTML document conversionprocessing section 40 starts analyzing the edited document (S824).

[0141] First, the original-versus-version HTML document conversionprocessing section 40 reads out one of the English paragraphs orsentences (original) or the Japanese paragraphs or sentences (translatedversion) stored in the data storage section 17, and makes adetermination as to whether the document constituent unit, i.e., theparagraph or sentence, is English (original) or Japanese (translatedversion) (S825). This determination can be definitely made based on thelanguage tags stored in the data storage section 17.

[0142] If the result of this determination is that the paragraph orsentence read out is English, the original-versus-version HTML documentconversion processing section 40 adds the language tag <E> to thebeginning of the paragraph (S826). If the paragraph read out isJapanese, the original-versus-version HTML document conversionprocessing section 40 adds the language tag <J> to the beginning of theparagraph (S827). The original-versus-version HTML document conversionprocessing section 40 substitutes (S828) the paragraph of the editeddocument for the corresponding paragraph of the original-versus-versionHTML document before editing by referring to the paragraph start addressstored in the data storage section 17.

[0143] The original-versus-version HTML document conversion processingsection 40 then proceeds to processing of the next paragraph and makes adetermination as to whether or not the end of the edited document hasbeen reached (steps S829, S830). The original-versus-version HTMLdocument conversion processing section 40 repeats the above-describedsteps S824 to S830 until the end of the edited document is reached.

[0144] The original-versus-version HTML document conversion processingsection 40 stores in the data storage section 17 theoriginal-versus-version HTML document obtained as a result of theabove-described sequence of HTML document restoration processing steps.The display processing section 12 opens, by the WWW browser 4, theoriginal-versus-version HTML document stored in the data storage section17, thereby making a display reflecting the editing results (S831).

[0145] When the original-versus-version HTML document is displayed onthe editor, the English paragraphs or sentences and the positions of thesame, and the Japanese paragraphs or sentences and the positions of thesame in the original-versus-version HTML document are stored in therespective storage areas in the data storage section 17, and theoriginal-versus-version HTML document before editing is also held by thedata storage section 17. From the information on the positions of theEnglish and Japanese paragraphs or sentences stored in the storageareas, the other portions can be identified as the tags. In this manner,the positions of the tags can be obtained reciprocally from thepositions of the English and Japanese paragraphs or sentences, and HTMLdocument restoration processing can be performed by using the tags inthe original-versus-version HTML document before editing with referenceto the tag position information obtained from the paragraph or sentencepositions.

[0146] The formed original-versus-version HTML document can be displayedin the form of an original-versus-version home page by the Internetdisplay tool WWW browser 4 if it may formed without invisibility tags.Selective display processing (see FIG. 5) of the original-versus-versionHTML document may be performed by the individual-language HTML documentconversion processing section 16 if necessary, as in the HTML documentprocessing apparatus 1 in above-described Embodiment 1, thereby enablingonly the Japanese HTML document (version) obtained by this processing tobe visually displayed by the display section 11.

[0147] [Embodiment 3]

[0148] [Configuration of HTML Document Processing Apparatus]

[0149] An HTML document processing apparatus 1 which representsEmbodiment 3 of the present invention has basically the same functionsand configuration as the HTML document processing apparatus 1 ofEmbodiment 2 shown in FIG. 7.

[0150] In the HTML document processing apparatus 1 in the thirdembodiment, however, there is no need for the automatic translationprocessing section 14 and the original-versus-version HTML documentforming processing section 15.

[0151] In this HTML document processing apparatus 1, the HTML documentanalysis and conversion processing section 18 and the English-Japanesetranslation editor (original-versus-version editor program) 19 cooperateto perform data conversion by removing HTML tags from an English HTMLdocument (original) of an English home page described in English,thereby enabling editing of the original and inputting of a translatedversion. A user edits the editable English sentences according to his orher need and inputs Japanese sentences (translated version).

[0152] Also, the original-versus-version HTML document conversionprocessing section 40 substitutes edited English and Japanese sentencesfor the original English sentences, restores the removed HTML tags, andsets translation-result Japanese sentences (translated version) incorrespondence with the edited sentences, thereby enabling visualdisplay of an original-versus-version HTML document containing a morefavorable version.

[0153] [Operation of HTML Document Processing Apparatus]

[0154] (Data Conversion Processing and Original Edit Processing ofEnglish HTML Document (Original))

[0155]FIG. 7 shows the configuration of this HTML document processingapparatus 1, and FIGS. 11 and 12 are flowcharts for explaining theoperation of this HTML document processing apparatus 1. Referring toFIGS. 7, 11, and 12, a user using the HTML document processing apparatus1 searches the Internet for a home page described in a language which heor she wishes to translate, by operating the input section 10 and basedon the contents of a display made by the display section 11. In thisexample, the object of this search (processing step S1101 in FIG. 11) isan English home page held by the WWW server 3.

[0156] Next, the user initiates a program for translation in the desireddirection by operating the input section 10 and based on the contents ofthe display made by the display section 11. In this example, the Englishhome page is to be finally translated into a Japanese HTML documentdescribed in Japanese. Therefore, an English-into-Japanese translationprogram is started (S1102).

[0157] The user clicks an “original-versus-version edit processingstart” button in an initial window (dialogue window) (not shown) of thetranslation program to start the English-Japanese translation editor(original-versus-version editor program) 19 (S1103).

[0158] The HTML document analysis and conversion processing section 18starts analyzing (S1104) the English HTML document (original) suppliedfrom the WWW server 3 through the communication control section 13.First, the HTML document analysis and conversion processing section 18recognizes a text portion of the English HTML document The HTML documentanalysis and conversion processing section 18 detects an HTML tag torecognize the section following this tag as an English text (S1105).

[0159] Next, the HTML document analysis and conversion processingsection 18 stores in the original storage area in the data storagesection 17 the start address of a document constituent unit paragraph(or sentence) and the contents of the paragraph (S1106). As a result ofthe processing in the above steps S1105 and S1106, the English paragraphor sentence from which the HTML tag has been removed is stored in thedata storage section 17.

[0160] If HTML tag is detected in above step S1105, or after thecompletion of processing in the step S1106, the HTML document analysisand conversion processing section 18 proceeds to processing of the nextparagraph in the English HTML document, and makes a determination as towhether or not an HTML tag </html> indicating the end of the HTMLdocument is detected (steps S1107, S1108).

[0161] The HTML document analysis and conversion processing section 18repeats the above-described steps S1104 to S1108 in cooperation with theEnglish-Japanese translation editor (original-versus-version editorprogram) 19 until the HTML tag indicating the end of the English HTMLdocument is detected.

[0162] When the HTML tag indicating the end of the English HTML documentis detected, the HTML document analysis and conversion processingsection 18 visually displays (S1109) in a left-hand section of thewindow of the English-Japanese translation editor (display section 11),the results of the above-described sequence of process steps forconverting the English HTML document data.

[0163] The user performs manual translation on the basis of the Englishparagraphs or sentences (original) visually displayed in the window ofthe English-Japanese translation editor, and forms translated sentences(Japanese sentences) by operating the input section 10. In thistranslation processing, the user edits the English sentences and, ifnecessary the Japanese translated sentences according to his or herneed. When the user clicks an “end” button in the English-Japanesetranslation editor window after suitably editing the original andcompleting the Japanese version, the English-Japanese translation editor(original-versus-version editor program) 19 stores the results ofediting and translation in the data storage section 17, therebycompleting the processing (S1110).

[0164] (Processing for Restoration to HTML Document Format)

[0165] In the HTML document processing apparatus 1, after theabove-described English HTML document (original) data conversionprocessing and original editing processing, a process described below isnext performed to enable restoration processing of the HTML documentformat.

[0166] When the user clicks an “HTML display” button in theEnglish-Japanese translation editor window, the English-Japanesetranslation editor (original-versus-version editor program) 19 is againstarted (S1111). By an instruction from the English-Japanese translationeditor 19, the original-versus-version HTML document conversionprocessing section 40 starts analyzing the resulting edited document(S1112).

[0167] First, the original-versus-version HTML document conversionprocessing section 40 reads out one of the English paragraphs orsentences (original) stored in the data storage section 17, and makes adetermination as to whether or not the document constituent unit, i.e.,the paragraph (or sentence), is English (original) (S1113). Thisdetermination can be definitely made based on the storage positionsstored in the left-right juxtaposition form in the data storage section17.

[0168] If the result of this determination is that the paragraph orsentence read out is English, the original-versus-version HTML documentconversion processing section 40 adds the language tag <E> to thebeginning of the paragraph (S1114). If the paragraph read out isJapanese, the original-versus-version HTML document conversionprocessing section 40 adds the language tag <J> to the beginning of theparagraph (S1115).

[0169] The original-versus-version HTML document conversion processingsection 40 substitutes (S1116) the paragraph of the resulting editedoriginal with the language tag added thereto for the correspondingoriginal paragraph of the HTML document before editing, by referring tothe paragraph start address stored in the data storage section 17, andinserts the translated sentence formed in the above-described step S1110immediately after the substituted original paragraph (S1117).

[0170] The original-versus-version HTML document conversion processingsection 40 then proceeds to processing of the next paragraph and makes adetermination as to whether or not the end of the edited document hasbeen reached (steps S1118, S1119). The original-versus-version HTMLdocument conversion processing section 40 repeats the above-describedsteps S1112 to S1119 until the end of the edited document is reached.

[0171] The original-versus-version HTML document conversion processingsection 40 stores in the data storage section 17 theoriginal-versus-version HTML document obtained as a result of theabove-described sequence of HTML document restoration processing steps.The display processing section 12 then opens, by the WWW browser 4, theoriginal-versus-version HTML document stored in the data storage section17, thereby making a display reflecting the editing results (S1120).

[0172] Also in this embodiment, HTML document restoration processing canbe performed by using the tags in the original-versus-version HTMLdocument in the same manner as that in Embodiment 2.

[0173] Also, selective display processing (see FIG. 5) of theoriginal-versus-version HTML document may be performed by theindividual-language HTML document conversion processing section 16 ifnecessary, as in the HTML document processing apparatus 1 in theabove-described Embodiment 1, thereby enabling only the Japanese HTMLdocument (translated version) obtained by this processing to be visuallydisplayed in the display section 11.

[0174] [Examples of Modification]

[0175] In the HTML document processing apparatus 1 in theabove-described third embodiment, translated sentences are formed bymanual translation. However, an automatic translation process using theautomatic translation processing section 14 may also be performed in thethird embodiment, similar to that in the HTML document processingapparatuses in the first and second embodiments. Conversely, manualtranslation may be performed in the HTML document processing apparatusesin the first and second embodiments.

[0176] In each of the HTML document processing apparatuses in theabove-described embodiments, an original written in English, istranslated into Japanese. However, the HTML document processingapparatus of the present invention can also operate in the same mannerwith respect to the respective languages, and can similarly process anHTML document having sentences written in three or more languages.

[0177] In the HTML document processing apparatus 1 in the firstembodiment, invisibility tags designating inhibition of visual displayare set as visibility control tags in an HTML document. However, it isalso possible to print only a document expressed in a particularlanguage selected by setting such tags.

[0178] Each of the above-described processes according to the presentinvention can be applied in association with a computer-readable medium.

[0179] Although only a few embodiments of the present invention havebeen described in detail above, those skilled in the art will readilyappreciate that many modifications are possible in the preferredembodiments without departing from the novel teachings and advantages ofthe present invention. Accordingly, all such modifications are intendedto be included within the scope of the present invention as defined bythe following claims.

What is claimed is:
 1. A document processing apparatus comprising: alanguage tag setting unit setting a language tag designating a kind oflanguage at each of constituent unit positions in an original and thosein a translated version corresponding to the original; and a documentforming unit forming a tagged document including the original and thetranslated version each having the language tag set therein.
 2. Adocument processing apparatus according to claim 1, wherein saiddocument forming unit forms a tagged document in which the original andthe translated version each having the language tag set therein aredescribed in an original-versus-version form.
 3. A document processingapparatus according to claim 1 or 2, further comprising a translationunit translating the original to make the translated version.
 4. Adocument processing apparatus according to any one of claims 1 to 3,wherein the original is contained in the processing-object taggeddocument.
 5. A document processing apparatus according to any one ofclaims 1 to 4, further comprising a visibility control tag setting unitsetting a visibility control tag for controlling any document portion sothat the document portion is invisible, said visibility control tagsetting unit setting the visibility control tag at such a position thatone of the original and the translated version is in an invisible state.6. A document processing apparatus according to claim 5, wherein saidvisibility control tag setting unit setting the visibility control tagat such a position that one of the original and the translated versionis set in an invisible state and the language tag is also set in aninvisible state.
 7. A document processing apparatus according to claim1, further comprising a display processing unit interpreting the tagcontained in the tagged document and for enabling a browser fordisplaying the tagged document to display the formed tagged document onthe basis of a result of the interpretation.
 8. A document processingapparatus comprising: a language tag setting unit setting a language tagat each of constituent unit positions in a first text described in afirst language and those in a second text described in a second languageof the first text; and a document forming unit forming a tagged documentincluding the first text and the second text each having the languagetag set therein.
 9. A document processing apparatus according to claim8, wherein said document forming unit forms a tagged document in whichthe first text and the corresponding second text each having thelanguage tag set therein are described by being related to each other.10. A document processing apparatus according to claim 8, furthercomprising a visibility control tag setting unit setting a visibilitycontrol tag for controlling a text so that the text is invisible, saidvisibility control tag setting unit setting the visibility control tagat such a position that one of the first text and the second text is inan invisible state.
 11. A document processing apparatus according toclaim 10, wherein said visibility control tag setting unit sets thevisibility control tag at such a position that one of the first text andthe second text is set in an invisible state and the language tag isalso set in an invisible state.
 12. A document processing apparatusaccording to claim 8, further comprising a display processing unitinterpreting the tag contained in the tagged document and for enabling abrowser for displaying the tagged document to display the formed taggeddocument on the basis of a result of the interpretation.
 13. A documentprocessing apparatus comprising: an analysis unit determining andextracting an original text from a processing-object tagged document;and an editing unit enabling edit processing of the original textextracted from the tagged document by displaying the original text. 14.A document processing apparatus according to claim 13, wherein saidanalysis unit determines the original text according to a language tagcontained in the processing-object tagged document.
 15. A documentprocessing apparatus according to claim 13 or 14, further comprising atranslation processing unit making a translated version by translatingthe original text edited by said editing unit.
 16. A document processingapparatus according to claim 15, further comprising a restoration unitrestoring, from the original text after the editing and the translatedversion made by said translation processing unit, a tagged document inthe same format as the processing-object tagged document, according totags contained in the processing-object tagged document, said tagsincluding the language tag.
 17. A document processing apparatuscomprising: an analysis unit determining an original text and atranslated version corresponding to the original text in aprocessing-object tagged document; and an editing unit enabling editingof the original text and the translated version by extracting theoriginal text and the translated version from the tagged document and bydisplaying the original text and the translated version in anoriginal-versus-version form.
 18. A document processing apparatusaccording to claim 17, wherein said analysis unit determines theoriginal text and the translated version according to language tagscontained in the processing-object tagged document.
 19. A documentprocessing apparatus according to claim 17 or 18, further comprising atranslation processing unit making a translated version by translatingthe original text edited by said editing unit.
 20. A document processingapparatus according to claim 19, further comprising a restoration unitrestoring, from the original text after the edit processing and thetranslated version made by said translation processing unit, a taggeddocument in the same format as the processing-object tagged document,according to tags contained in the processing-object tagged document,said tags including the language tag.
 21. A recording medium readable bya computer, said recording medium having a program recorded thereon,said program for enabling the computer to execute steps of: setting alanguage tag designating a kind of language at each of constituent unitpositions in an original and those in a translated version correspondingto the original; and forming a tagged document including the originaland the translated version each having the language tag set therein. 22.A recording medium according to claim 21, wherein said program recordedon said recording medium enables the computer to further execute a stepof setting a visibility control tag for controlling any document portionso that the document portion is invisible, said visibility control tagbeing set at such a position that one of the original and the translatedversion is in an invisible state.
 23. A recording medium readable by acomputer, said recording medium having a program recorded thereon, saidprogram for enabling the computer to execute steps of: setting alanguage tag designating a kind of language at each of constituent unitpositions in a first text described in a first language and those in asecond text described in a second language of the first text; andforming a tagged document including the first text and the second texteach having the language tag set therein.
 24. A recording mediumreadable by a computer according to claim 23, wherein said programrecorded on said recording medium enables the computer to furtherexecute a step of setting a visibility control tag for controlling atext so that the document is invisible, said visibility control tagbeing set at such a position that one of the first text and the secondtext is in an invisible state.
 25. A recording medium readable by acomputer, said recording medium having a program recorded thereon, saidprogram for enabling the computer to execute steps of: determining andextracting an original text from a processing-object tagged document;and enabling editing of the original text extracted from the taggeddocument by displaying the original text.
 26. A recording mediumreadable by a computer according to claim 25, wherein said programrecorded on said recording medium enables the computer to furtherexecute steps of: making a translated version by translating theoriginal text after editing of the original text; and restoring, fromthe original text after the edit processing and the translated versionformed by said translation, a tagged document in the same format as theprocessing-object tagged document according to tags contained in theprocessing-object tagged document, said tags including a language tagdesignating a kind of language.
 27. A recording medium readable by acomputer, said recording medium having a program recorded thereon, saidprogram for enabling the computer to execute steps of: determining anoriginal text and a translated version corresponding to the originaltext in a processing-object tagged document; and enabling editprocessing of the original text and the translated version by extractingthe original text and the translated version from the tagged documentand by displaying the original text and the translated version in anoriginal-versus-version form.
 28. A recording medium readable by acomputer according to claim 27, wherein said program recorded on saidrecording medium enables the computer to further execute steps of:forming a translated version by translating the original text afterediting of the original text; and restoring, from the original textafter the edit processing and the translated version made by saidtranslation, a tagged document in the same format as theprocessing-object tagged document, according to tags contained in theprocessing-object tagged document, said tags including a language tagdesignating a kind of language.
 29. A recording medium readable by acomputer, said recording medium having a tagged document recordedthereon, wherein said recorded tagged document comprising: a firstlanguage tag designating a first kind of language; a first textfollowing said first language tag, said first text being described in afirst language; a second language tag following said first text, saidsecond language tag designating a second kind of language; and a secondtext following said second language tag, said second text beingdescribed in a second language, said second text corresponding to saidfirst text.
 30. A recording medium readable by a computer, saidrecording medium having a tagged document recorded thereon, wherein saidrecorded tagged document comprising: an invisibility start tag forsetting a following text in an invisible state; a first language tagfollowing said invisibility start tag, said first language tagdesignating a first kind of language; a first text following said firstlanguage tag, said first text being described in a first language; asecond language tag following said first text, said second language tagdesignating a second kind of language; an invisibility end tag followingsaid second language tag, said invisibility end tag canceling theinvisible state; and a second text following said invisibility end tag,said second text being described in a second language, said second textcorresponding to said first text.
 31. A document processing methodcomprising steps of: setting a language tag designating a kind oflanguage at each of constituent unit positions in an original and thosein a translated version corresponding to the original; and forming atagged document including the original and the translated version eachhaving the language tag set therein.
 32. A document processing methodaccording to claim 31, wherein further comprising a step of setting avisibility control tag for controlling any document portion so that thedocument portion is invisible, said visibility control tag being set atsuch a position that one of the original and the translated version isin an invisible state.
 33. A document processing method comprising stepsof: setting a language tag designating a kind of language at each ofconstituent unit positions in a first text described in a first languageand those in a second text described in a second language of the firsttext; and forming a tagged document including the first text and thesecond text each having the language tag set therein.
 34. A documentprocessing method according to claim 33, wherein further comprising astep of setting a visibility control tag for controlling a text so thatthe document is invisible, said visibility control tag being set at sucha position that one of the first text and the second text is in aninvisible state.
 35. A document processing method comprising steps of:determining and extracting an original text from a processing-objecttagged document; and enabling editing of the original text extractedfrom the tagged document by displaying the original text.
 36. A documentprocessing method according to claim 35, wherein further comprisingsteps of: making a translated version by translating the original textafter editing of the original text; and restoring, from the originaltext after the edit processing and the translated version formed by saidtranslation, a tagged document in the same format as theprocessing-object tagged document according to tags contained in theprocessing-object tagged document, said tags including a language tagdesignating a kind of language.
 37. A document processing methodcomprising steps of: determining an original text and a translatedversion corresponding to the original text in a processing-object taggeddocument; and enabling edit processing of the original text and thetranslated version by extracting the original text and the translatedversion from the tagged document and by displaying the original text andthe translated version in an original-versus-version form.
 38. Adocument processing method according to claim 37, wherein furthercomprising steps of: forming a translated version by translating theoriginal text after editing of the original text; and restoring, fromthe original text after the edit processing and the translated versionmade by said translation, a tagged document in the same format as theprocessing-object tagged document, according to tags contained in theprocessing-object tagged document, said tags including a language tagdesignating a kind of language.